Byte pair encoding について

Words near each other

・ Bysławek
・ Byt
・ Bytarayanahalli betta
・ Bytarayappana Betta
・ Byte
・ Byte (dinghy)
・ Byte (disambiguation)
・ Byte (magazine)
・ Byte (retailer)
・ Byte addressing
・ Byte Bandit
・ Byte Code Engineering Library
・ Byte Information Exchange
・ Byte order mark
・ Byte orientation
・ Byte pair encoding
・ Byte Records
・ Byte serving
・ Byte-oriented protocol
・ Bytecode
・ Byteflight
・ Bytemark
・ Bytes (album)
・ Bytes for All v. Federation of Pakistan
・ BytesForAll
・ Byth
・ Bythaelurus
・ Bythaelurus lutarius
・ Bytham Castle
・ Bytham River

Dictionary Lists

mini英和辞書

翻訳と辞書　辞書検索 [ 開発暫定版 ]

スポンサードリンク

Byte pair encoding ：ウィキペディア英語版

Byte pair encoding
Byte pair encoding〔Philip Gage, ''A New Algorithm for Data Compression''. 〕 or digram coding〔Ian H. Witten, Alistair Moffat, and Timothy C. Bell. ''Managing Gigabytes''. New York: Van Nostrand Reinhold, 1994. ISBN 978-0-442-01863-4.〕 is a simple form of data compression in which the most common pair of consecutive bytes of data is replaced with a byte that does not occur within that data. A table of the replacements is required to rebuild the original data. The algorithm was first described publicly by Philip Gage in a February 1994 article "A New Algorithm for Data Compression"
in the ''C Users Journal''.〔
〕
==Byte pair encoding example==
Suppose we wanted to encode the data
aaabdaaabac
The byte pair "aa" occurs most often, so it will be replaced by a byte that is not used in the data, "Z". Now we have the following data and replacement table:
ZabdZabac
Z=aa
Then we repeat the process with byte pair "ab", replacing it with Y:
ZYdZYac
Y=ab
Z=aa
We could stop here, as the only literal byte pair left occurs only once.
Or we could continue the process and use recursive byte pair encoding, replacing "ZY" with "X":
XdXac
X=ZY
Y=ab
Z=aa
This data cannot be compressed further by byte pair encoding because there are no pairs of bytes that occur more than once.
To decompress the data, simply perform the replacements in the reverse order.

抄文引用元・出典: フリー百科事典『ウィキペディア（Wikipedia）』
■ウィキペディアで「Byte pair encoding」の詳細全文を読む

スポンサードリンク

翻訳と辞書 : 翻訳のためのインターネットリソース